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Q , Abstract 

, For a {0, l}-valucd matrix M let CC(Af ) denote the deterministic communication complexity 

' of the boolean function associated with M . The log-rank conjecture of Lovasz and Saks [FOCS 

CN ! 1988] states that CC(M) < log'=(rank(M)) for some absolute constant c where rank(M) denotes 

the rank of M over the field of real numbers. We show that CC(M) < c • rank(M) / log rank(M) 
. for some absolute constant c, assuming a well-known conjecture from additive combinatorics 

r \ ' known as the Polynomial Freiman-Ruzsa (PFR) conjecture. 

Our proof is based on the study of the "approximate duality conjecture" which was recently 
suggested by Ben-Sasson and Zewi [STOC 2011] and studied there in connection to the PFR 
conjecture. First we improve the bounds on approximate duality assuming the PFR conjecture. 
Then we use the approximate duality conjecture (with improved bounds) to get the aforemen- 
tioned upper bound on the communication complexity of low-rank martices, where this part 
' uses the methodology suggested by Nisan and Wigderson [Combinatorica 1995]. 

00 ' 
00 

^ : 1 Introduction 

This paper presents a new connection between communication complexity and additive combina- 
torics, showing that a well-known conjecture from additive combinatorics known as the Polynomial 
Freiman-Ruzsa Conjecture (PFR, in short), implies better upper bounds than currently known 
on the deterministic communication complexity of a boolean function in terms of the rank of its 
^ ' associated matrix. More precisely, our results show that the PFR Conjecture implies that every 

' boolean function has communication complexity 0(rank(M)/ log rank(M)) where rank(M) is the 

rank, over the reals, of the associated matrix. This is a weakening of a well-known conjecture in 
communication complexity, known as the log-rank conjecture, which postulates that every boolean 
function has communication complexity log*^^^-* rank(M). 

Our analysis relies on the study of approximate duality, a concept closely related to the PFR 
Conjecture, which was introduced in [BZll]. Our main technical contribution is improved bounds 
on approximate duality, assuming the PFR Conjecture. We then use these bounds in order to prove 
the new upper bounds on communication complexity. 
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1.1 Communication complexity and the log-rank conjecture 

In the two-party communication complexity model two parties — Alice and Bob — wish to compute 
a function / : X y. Y — >■ {0,1} on inputs x and y where x is known only to Alice and y is 
known only to Bob. In order to compute the function / they must exchange bits of information 
between each other according to some (deterministic) protocol. The (deterministic) communication 
complexity of a protocol is the maximum total number of bits sent between the two parties, where 
the maximum is taken over all pairs of inputs x, y. We henceforth omit the adjective "deterministic" 
from our discourse because our results deal only with the deterministic model. The communication 
complexity of the function /, denoted by CC(/), is the minimum communication complexity of a 
protocol for /. 

For many applications it is convenient to associate the function / : X x y — t- {0, 1} with 
the matrix M S {0,1}"'''^^ whose (x,y) entry equals f{x,y). For a {0, 1}- valued matrix M, let 
CC(M) denote the communication complexity of the boolean function associated with M. Let 
rank(M) denote the rank of M over the reals. We will occasionally consider the rank of M over 
the two-element field F2 and will denote this by rank]F2(M). 

It is well-known that logrank(M) < CC(M) < rank(M) (cf. [KN97, Chapter 1.4]) and it is a 
fundamental question to find out what is the true dependency of CC(M) on the rank. The log-rank 
conjecture due to Lovasz and Saks [LS88] postulates that the gap between the above lower bound 
and upper bound on CC(M) is polynomial rather than exponential: 

Conjecture 1.1 (Log-rank). For every {0, l}-valued matrix M 

CC(Af) = log^(^) rank(M). 

Lovasz and Saks also point out that the above conjecture has several other interesting equivalent 
formulations. One of them, due to Nuffelen [Nuf76] and Fajtlowicz [Faj88], is the following: 

Conjecture 1.2. For every graph G, x{G) < log*^*-"*^^ rank(G), where x{G) is the chromatic number 
of the complement of G, and rank(G) is the rank of the adjacency matrix of G over the reals. 

Though considerable effort has been made in an attempt to resolve this conjecture since its 
introduction 25 years ago, to this date not much is known regarding to it. The best currently known 
upper bound on the communication complexity of a {0, l}-valued matrix M with respect to its rank 
is CC(M) < log(4/3) •rank(M) = (0.415...) • rank(M) due to Kotlov [Kot97] and it improves on the 
previous best bound of CC(M) < rank(M)/2 by Kotlov and Lovasz [KL96]. In the other direction, 
the best lower bound is CC(M) > r2(log* rank(M)) for t = log3 6 = 1.631... due to Kushilevitz 
(unpublished, cf. [NW95]) which improved on a previous bound with t = log2 3 = 1.585... due to 
Nisan and Wigderson [NW95]. 

Our main result is an upper bound of 0(rank(Af)/logrank(M)) on the communication com- 
plexity of a {0, l}-valued matrix M assuming a well-known conjecture from additive combinatorics 
— the Polynomial Freiman-Ruzsa conjecture — discussed next. 

1.2 Additive combinatorics and the Polynomial Freiman-Ruzsa conjecture 

Quoting the (current) Wikipedia definition, additive combinatorics studies "combinatorial estimates 
associated with the arithmetic operations of addition and subtraction". As such, it deals with a 
variety of problems that aim to 'quantify' the amount of additive structure in subsets of additive 
groups. One such a problem is that which is addressed by the Polynomial Freiman-Ruzsa conjecture 
(we shall encounter a different problem in additive combinatorics when we get to "approximate 
duality" later on). 
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For A(ZW^,\et A + A denote the sum-set of A 

A + A := {a + a' \ a,a' £ A} 

where addition is over F2. It is easy to see that |^ + ^| = |^| if and only if A is an affine subspace 
of The question addressed by the Preiman-Ruzsa Theorem is whether the ratio of 1^4 + ^| to 
1^1 also 'approximates' the closeness of A to being a subspace, or in other words, whether the fact 
that A + j4 is small with respect to the size of A also implies that span (A) is small with respect to 
the size of A. The Freimnan-Ruzsa Theorem [Ruz99] says that this is indeed the case. 

Theorem 1.3 (Freiman-Ruzsa Theorem [Ruz99]). If A C has \A+A\ < K\A\, then |span(A) | < 
K^2'<'\A\. 

The above Theorem was improved in a series of works [GR06, San08, GT09] and the best 
currently known upper bound on the ratio ^'^^'|^|'^''^ is 2(^"''°(^))^ due to Green and Tao [GT09]. 
This can be seen to be almost tight (up to the o(l) term) by letting A = {ui, • • • 1 ut}, where 
ui,U2, ■ ■ ■ ,ut G F2 are linearly independent vectors. Then in this case we have |A + A| ~ f |^|, 
while I span {A) | = 2*. 

The same example also shows that the ratio ^'^^'|^|^''^ must depend exponentially on K. However, 

this example does not rule out the existence of a large subset A' A for which the ratio ^'^'^^^^ 
is just polynomial in K, and this is exactly what is suggested by the PFR Conjecture: 

Conjecture 1.4 (Polynomial Freiman-Ruzsa (PFR)). There exists an absolute constant r, such 
that if A has \ A + A\ < K\A\, then there exists a subset A' A of size at least K~'^\A\ such 

that |span {A') | < |^|. 

Note that the above conjecture implies that |span(A') | < |A| < The PFR conjecture 

has many other interesting equivalent formulations, see the survey of Green [Grc05] for some of 
them. It is conjectured to hold for subsets of general groups as well and not only for subsets of the 
group F2 but we will be interested only in the latter case. Significant progress on this conjecture 
has been achieved recently by Sanders [SanlO], using new techniques developed by Croot and Sisask 
[CSIO]. Sanders proved an upper bound on the ratio ^^^^^^^ which is quasi-polynomial in K: 

Theorem 1.5 (Quasi-polynomial Freiman-Ruzsa Theorem [SanlO]). Let A C ¥2 be a set such 
that \A + A\ < K\A\. Then there exists a subset A' (1 A of size at least K"^^^°^'^ ^'>\A\ such that 
|span(A') I < |^|. 

Our main result is the following: 

Theorem 1.6 (Main.). Assuming the PFR Conjecture I.4, for every {0, l}-valued matrix M 

CC(M) = 0(rank(M)/logrank(M)). 

We end this section by mentioning two other recent applications of the PFR Conjecture to 
theoretical computer science. The first application is to the area of low-degree testing, and was 
discovered by Lovett [LovlO] and, independently, by Green and Tao [GTIO]. The second application 
is to the construction of two-source extractors due to Ben-Sasson and Zewi [BZll]. The latter paper 
also introduced the notion of approximate duality which plays a central role in our proof method 
so we describe it now. 
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1.3 Approximate duality 

Our main technical contribution is improved bounds on approximate duality, assuming the PFR 
conjecture. These new bounds lie at the heart of our proof of the Main Theorem 1.6. 

The notion of approximate duality was first introduced in [BZll]. For A,B CI F2, we define the 
duality measure of A,B in (1) as an estimate of how 'close' this pair is to being dual 



DiA,B) 



aeA,b&B 



■1) 



(1) 



where (a, 6)2 denotes the binary inner-product of a, b over F2, defined by {a, b)2 = Y17=i '^i ' where 
all arithmetic operations are in F2. 

Remark 1.7. The duality measure can be alternatively defined as the discrepancy of the inner 
product function on the rectangle A x B (up to a normalization factor of |^^|^| ). Nevertheless we 
chose to use the term 'duality measure' instead of 'discrepancy' because of the algebraic context in 
which we use it, as explained below. 

It can be verified that if D{A,B) = 1 then A is contained in an afhne shift of B^ which is the 
space dual to the linear F2-span of B. The question is what can be said about the structure of A, B 
when D{A,B) is sufficiently large, but strictly smaller than 1. The following theorem from [BZll] 
says that if the duality measure is a constant very close to 1 (though strictly smaller than 1) then 
there exist relatively large subsets A' <^ A, B' <^ B, such that D{A' , B') = 1. 

Theorem 1.8 (Approximate duality for nearly-dual sets, [BZll]). For every 5 > Q there exists a 
constant e > that depends only on 5, such that if A,B C Fg satisfy D{A,B) > 1 — e, then there 
exist subsets A' C A, \A'\ > l\A\ and B' C B, \B'\ > 2~^'^\B\, such that D{A',B') = 1. 

It is conjectured that a similar result holds also when the duality measure is relatively small, 
and in particular when it tends to zero as n goes to infinity. Furthermore, the following theorem 
from [BZll] gives support to this conjecture, by showing that such bounds indeed follow from the 
PFR conjecture. 

Theorem 1.9 (Approximate duality assuming PFR, [BZll]). Assuming the PFR Conjecture I.4, 
for every pair of constants a,5 > there exists a constant C > 0, depending only on a and 6, such 
that the following holds. If A,B C F^ satisfy \A\, \B\ > 2"'" and D{A,B) > 2"^", then there exist 
subsets A' C A, \A'\ > 2-'^"|A| and B' C B, \B'\ > 2^^"|B| such that D{A',B') = 1. 

Our main technical contribution is an improved bound on approximate duality assuming the 
PFR conjecture. 

Lemma 1.10 (Main technical lemma). Suppose that A,B C F2 satisfy D{A,B) > 2~^. 
Then assuming the PFR conjecture I.4, there exist subsets A',B' of A,B respectively such that 
D{A',B') = I, and \A'\ > 2-^'^/i°s"|A|, \B'\ > 2-Wiogn|5| for some absolute constant c. 

Remark 1.11. The bound on min ■'j^l — which in the lemma above is 2~'^"/'°S" — cannot be 

improved beyond 2~^^^'^ even if we assume D{A, B) > 0.99. To see this take A = B = (^/^) to be 
the set of all {0, l}-vectors with exactly c'^/n ones, where c' is a sufficiently small positive constant 
that guarantees D{A,B) > 0.99. It can be verified that if A' C A,B' C B satisfy D{A',B') = 1 
then the smaller set is of size 2"*^^^) • |^|. 
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The proof of Lemma 1.10 appears in Section 2. Note that this lemma improves on Theorem 1.9 
in two ways. First, in the lemma the ratios are bounded from below by 2~'^"'/'°s"^ 

whereas in Theorem 1.9 we only get a smaher bound of the form 2~^'^ for some constant 6 > 0. 
Note however that this improvement comes with a requirement that the duahty measure D(A,B) 
is larger — in the above lemma we require that it is at least while in Theorem 1.9 we only 

require it to be at least 2"''" ^ 2^^. We note however that the bound D{A,B) > 2~^ can be 
replaced by D{A,B) > exp(— n^^'^) for any e > at the price of a larger constant c = c(e). The 
second improvement is that the parameters in the lemma do not depend on the sizes of the sets A 
and B whereas in Theorem 1.9 they do depend on \ A\, \B\. 

These two improvements are precisely what enables us to prove the new upper bound of 
0(rank(M)/logrank(M)) on the communication complexity of {0, l}-valued matrices assuming 
the PFR conjecture, and we go on to sketch our proof method next. 

1.4 Proof overview 

First we show how our Main Theorem 1.6 is deduced from the improved bounds on approximate 
duality in Lemma 1.10. Then we give an overview of the proof of Lemma 1.10 itself. 

Prom approximate duality to communication complexity upper bounds. We follow the 
approach of Nisan and Wigderson from [NW95]. Let the size of a matrix M be the number of 
entries in it and if M is {0, l}-valued let 6{M) denote its (normalized) discrepancy, defined as the 
absolute value of the difference between the fraction of zero-entries and one-entries in M. Informally, 
discrepancy measures how "unbalanced" is M, with 6{M) = 1 when M is monochromatic — all 
entries have the same value — and 6{M) = when M is completely balanced. 

Returning to the work of [NW95], they observed that to prove the log-rank conjecture it suffices 
to show that a {0, 1}- valued matrix M of rank r always contains a monochromatic sub-matrix of 
size |M|/q.poly(r) where q.poly(r) = r^^s'^^ ^ means quasi-polynomial in r. Additionally, they 
used spectral techniques (i.e., arguing about the eigenvectors and eigenvalues of M) to show that 
any {0, l}-valued matrix M of rank r contains a relatively large submatrix M' — of size at least 
\M\/r^^'^ — tliat is somewhat biased — its discrepancy is at least 1/r^/^. We show, using tools from 
additive combinatorics, that M' in fact contains a pretty large monochromatic submatrix (though 
not large enough to deduce the log-rank conjecture). 

To this end we start by working over the two-element field F2. This seems a bit counter- 
intuitive because the log-rank conjecture is false over F2. The canonical counterexample is the 
inner product function IP{x,y) = {x,y)2 — It is well-known (see e.g. [KN97] [Chapters 1.3., 2.5.]) 
that rankF2(M7-p) = n while CC(IP) = n. However, rather than studying M over F2 we focus on 
the biased submatrix M' and things change dramatically. (As a sanity-check notice that Mjp does 
not contain large biased submatrices and this does not contradict the work of [NW95] because the 
rank of Mjp over the reals is 2" — 1.) 

Thus, our starting point is a large submatrix M' that has large discrepancy. It is well-known 
that rankF2(-/Vf' ) < rank(M') < r and that this implies M' can be written as M' = A~^ ■ B 
where A,B are matrices whose columns are vectors in F2. Viewing each of A,B as the set of its 
columns, we have in hand two sets that have a large duality measure as defined in (1), namely, 
D{A,B) = 5{M') > 1/r^/^. This is setting in which we apply our Main Technical Lemma 1.10 
and deduce that A,B contain relatively large subsets A',B' with D{A',B') = 1. One can now 
verify that the submatrix of M' whose rows and columns are indexed by A', B' respectively is 
indeed monochromatic, as needed. We point out that to get our bounds we need to be able to 
find monochromatic submatrices of M' even when M' is both small and skewed (i.e., has many 
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more columns than rows or vice versa). Fortunately, Lemma 1.10 is robust enough to use in such 
settings. 

Improved bounds on approximate duality assuming PFR. We briefly sketch the proof of 
our Main Technical Lemma 1.10. We use the spectrum of a set as defined in [TV06, Chapter 4]: 

Definition 1.12 (Spectrum). For a set i? C F2 and a € [0, 1] let the a-spectrum of B be the set 

Specji?) := {x G I | E.g^ [(-1)<-''')^] |> a}. (2) 

Notice that A C Spec^(i?) implies D{A,B) > e (cf. (1)). In the other direction, Markov's 
inequality can be used to deduce that D[A,B) > e implies the existence of A' C yl of relatively 
large size — \A'\ > ||^| — such that A' C Spec^/2(^)- To prove our lemma we start with Ai = A' 
and establish a sequence of sets 

such that Ai C Spec^, (i?) for all i. This holds by construction for Ai with ei = e/2, and we show 
that it is maintained throughout the sequence for increasingly smaller values of (we shall use 

Moving our problem from the field of real numbers to the two-element field F2 now pays off. 
Each Ai is of size at most 2" so there must be an index i < n/\ogK for which |Ai+i| < let t 

be the minimal such index. We use the PFR conjecture together with the Balog-Szemeredi-Gowers 
Theorem 2.2 from additive combinatorics to show that our assumption that |At+i| < -^l^tl implies 
that a large subset A!l of At has small span (over F2). 

We now have in hand a set A'l which is a relatively large fraction of its span and additionally 
satisfies D{A'l^B) > et because by construction A'l C Spec^j(i?). We use an approximate duality 
claim from [BZll] (Lemma 2.3) which applies when one of the sets is a large fraction of its span 
(in our case the set which is a large fraction of its span is A!l). This claim says that A'l and B each 
contain relatively large subsets A'^,B'^ satisfying D{A'^,B'^) = 1. Finally, recalling A'^ is a (carefully 
chosen) subset of At-i + At-i, we argue that At-i contains a relatively large subset that is 
"dual" to a large subset B'^_i of S^, where by "dual" we mean Z)(yl^_^, i?^_]^) = 1 (in other words 
A'^_i is contained in an affine shift of the space dual to span (^B'^_-^). We continue in this manner 
to find pairs of "dual" subsets for t — 2, f — 3, . . . , 1 at which point we have found a pair of "dual" 
subsets of A^ B that have relatively large size, thereby completing the proof. 

Paper organization. The next section contains the proof of the Main Technical Lemma 1.10. 
The proof of Main Theorem 1.6 given the Main Technical Lemma 1.10 appears in Section 3. 

2 Improved bounds on approximate duality assuming PFR 

In this section we prove our Main Technical Lemma 1.10 by proving the following more general 
version of it. 

Lemma 2.1 (Main technical lemma, general form). Suppose that A,B CI {0, 1}" satisfy D(A, B) > 
e. Then assuming the PFR Conjecture I.4, for every K > 1 and t = n/logK, there exist subsets 
A',B' of A,B respectively such that D{A',B') = 1, and 

Proof of Lemma 1.10. Follows from Lemma 2.1 by setting K = 24"/i°g", t = e = 2'^. □ 



6 



Additive combinatorics preliminaries In what follows all arithmetic operations are taken 
over F2. For the proof of Lemma 2.1 we need two other theorems from additive combinatorics. The 
first is the well-known Balog-Szemeredi-Gowers Theorem of [BS94, Gow98]. 

Theorem 2.2 (Balog-Szemeredi-Gowers). There exist fixed polynomials f{x,y), g{x,y) such that 
the following holds for every subset A of an abelian additive group. If A satisfies Pr a^a'eA[o- + a' £ 
S] > 1/K for \S\ < C\A\, then one can find a subset A' O A such that \A'\ > \A\/f{K,C), and 
\A' + A'\ < g{K,C)\A\. 

The second is a lemma from [BZll] which can be seen as an approximate duality statement 
which applies when one of the sets has small span: 

Lemma 2.3 (Approximate-duality for sets with small span, [BZll]). If D{A, B) > e, then there 
exist subsets A' <^ A,B' <Z B, \A'\ > ^\A\, \B'\ > x |spln(A)| 1^1^ ^^'^^ ^^'^^ D{A',B') = 1. // 
A C Spec^(i?) then we have \A'\ > \A\/2 and \B'\ > | spin(A)| l-^l statement above. 

Recall the definition of the spectrum given in (2): 

Spec„(i?) := {x G I | E.^b [(-l)^"'''^^] |> «}• 

Finally, for 5" C F2 and x £ ¥2 let rep5(x) be the number of different representations of x as an 
element of the form s + s' where s,s' G 5. rep5(x) can also be written, up to a normalization 
factor, as I5 * 15(2;) where I5 is the indicating function of the set S and * denotes convolution. 

Proof overview We construct a decreasing sequence of constants 

ei = €/2,e2 = el/2, e3 = el/2,... 

and a sequence of sets 

^1 :=^nSpec,^(5), A2Q{Ai+Ai)nSpec,^{B), A3 <^ {A2 + A2) D Sj>ec,.^{B) , . . . 

Since each of the sets in the sequence is of size at most 2"" there must be an index i < n/ log K for 
which 

|A,+i| < K\Ai\ (4) 

and let t be the minimal such index. The PFR Conjecture 1.4 together with the Balog-Szemeredi- 
Gowers Theorem 2.2 will be used to deduce from (4) that a large subset A'/ of At has small span. 
Applying Lemma 2.3 to the sets A'/ and B implies the existence of large subsets A[ C At and 
B't <^ B such that D{At, B't) = 1. Finally we argue inductively for i = t — l,t — 2, . . . ,1 that there 
exist large subsets A'- C Aj and B'- (1 B such that DiA'-, B'-) = 1. The desired conclusion will follow 
from the i = 1 case. To be able to "pull back" and construct a pair of large sets A'-_^, B''_^ from 
the pair A[, B[ we make sure every element in Aj is the sum of roughly the same number of pairs 
in Aj_i X Aj_i. 

The sequence of sets Let ei := e/2, Ai := A n Spec^j(i?). Assuming Aj_i,ei_i have been 
defined set = ef_i/2 and let ji G {0, . . . , n — 1} be an integer index which maximizes the size of 

|(a,a ) G Ai_i I a + a G Spec,^(S) and 2^' < rep^^_^(a + a) < 2J'+^| . (5) 

and set 

Ai := {a + a' : a, a' G Ai_i,a + a' G Spec,^(5) and 2^' < rep^^_^(a + a') < 2^'-+^}. (6) 



7 



Claim 2.4. For i = 1 we have \Ai\ > (e/2)|j4|. For i > 1 we have 



and additionally 



Pr [a + a' G Ai] > ei/n (7) 

a,a' GAi-i 



\M > — T^l^i-lP- (8) 



Proof. The case of i = 1 follows directly from Markov's inequality. For larger i we argue that 

Pr [a + a e Spec,. {B)]> a. 

To see this use Cauchy-Schwarz to get 

and apply Markov's inequality to deduce that an ej-fraction of (a, a') £ Aj_i x sum to an 

element of Spec^. (i?). Selecting to maximize (5) yields inequality (7). Since every element x £ Ai 
can be represented as x = a + a' with a,a' £ in at most 2^'~^^ different ways we deduce (8) 
from (7) and complete the proof. □ 

The inductive claim Let t be the minimal index such that |Ai+i| < -f^l^tl and note that 
t < n/logK because all sets Ai are contained in Fg. We shall prove the following claim by 
backward induction. 

Claim 2.5 (Inductive claim). For i = t,t — 1, . . . ,1 there exist subsets 

A'i C A,, B'i C Bi 
such that D{A[,B[) = 1 and A'^,B[ are not too small: 

i-^'i 2 p*(^)(^")-"-(ri^'«)'-'''' - 

We split the proof of the claim to two parts. The base case (Proposition 2.6) is proved using 
the tools from additive combinatorics listed in the beginning of this section. The inductive step is 
proved in Proposition 2.7 using a graph construction. Before proving Claim 2.5 we show how it 
implies Lemma 2.1. 



i, so 



Proof of Main Technical Lemma 2.1. Set i = 1 in Claim 2.5 above. Recall that ej+i = ef/2 for all 



e,^, = e''/2''~'>{e/2f. 
Thus we have ej+i > (e/2)^ and n^=i 

> (e/2)2'+\ This gives the bounds on A' ., B' stated in 
(3). □ 

Proposition 2.6 (Base case of Claim 2.5 {i = t)). There exist subsets A'f- C At, B[ C Bt such that 
D(A[,B't) = 1 and A't,B[ are not too small: 

|A;i>poly(f^)|AI. |B;|>poly(^)|B|. 
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Proof. By assumption < -f^l^tl and -P^Q^a'sAt [« + S At+i] > et+i/n by (8). Hence we 

can apply the Balog-Szemeredi-Gowers Theorem (Theorem 2.2) to the set Af to obtain a subset 
At ^ At such that 

|i,|>p„,y(Sil)|^,|, 

and 

\At + At\ <poly( — Wtl =poly( — 



Now we can apply the PFR Conjecture 1.4 to the set At which gives a subset A'^ C At such 
that 



and 



|spanK)l<l^*l=Polyf— 



Recall that A'/ C Spec^^(i?), and in particular D{A'l,B) > et- Applying Lemma 2.3 to the sets 
A'l and B we conclude that there exist subsets A'^ C A'l, B' B such that D(A[,B') = 1, and 
which satisfy \A[\ > ^\A'l\ and 



This completes the proof of the base case. □ 

Proposition 2.7 (Inductive step of Claim 2.5). For every i = t — there exist subsets 

A'l C Ai, B'- C Bi such that D[A^,B'j) = 1 and A'^,B^ are not too small: 

Proof. Suppose that the claim is true for i argue it holds for index i — 1. Let G = {Ai-i,E) be 
the graph whose vertices are the elements in Ai^i, and {a, a') is an edge if o + o' G A'^^. We bound 
the number of edges in this graph from below. Recall from (6) that every a € A[ (where A[ <Z Ai) 
satisfies 2^^ < rep^._^(a) < 2-''+^. Using this we get 



\E\ > 2-?'» • \A'i\ (repA,_i(2:) > 2^* for all x G A[] 

nK 

> 2^-poly(i^)(4n)-(*-)( nL.em)^l^.-iP (by (8)) 



> 2^^ ■ poly ( ^ (4n) ( Hki ^^+1 ) l^il (induction hypothesis) 




Let M := poly^^^ (4n)^(*"(*^^)) ^Hki-i ^^+1^ • Since our graph has at least 2M|^i_ip 

edges and vertices, it has a connected component with at least 2M|Ai_i| vertices and denote 

by A'l_^ the set of vertices in it. 
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Choose an arbitrary element a in A'l_^. Partition B[ into two sets B'-q and such that all 
elements in B[ q have inner product with a, and all elements in B'- ^ have inner product 1 with a. 
Let -Bj'_i be the larger of B'-q,B'- ^, and note that > |-B-|/2. Recall that our assumption was 

that D(A'i, B[) = 1. Abusing notation, let (^'., B'>j2 denote the value of (a', h')2 for some a' € A'-, B[ 
(the choice of a', h' doesn't matter because D{A'^, B'j) = 1). Next we consider two cases — the case 
where {A[,B[)2 = 0, and the case where {A[,B[)2 = 1. 

In the first case we have that for every a, a' G which are neighbors in the graph, a + a' G ^4^, 
and therefore (a + a', 6)2 = for every h € B[_^. This implies in turn that (0,6)2 = W ^^)2 for 
all elements a, a' G ^f-i which are neighbors in the graph, b G Since ^^'„i induces a 

connected component, and due to our choice of -B^_i, this implies that D{A'l_'^, ^i-i) = 1 so we set 
A' — A" 

In the second case we have that (a + a', 6)2 = 1 for every a, a' G A'l_^ which are neighbors in the 
graph, h G B'-_^. In particular this implies that (a, 6)2 = (a', 6)2 + 1 for every elements a, a' G A'-_^ 
which are neighbors in the graph, b G B'-_^. This means that A'-_^ can be partitioned into two sets 
^'i-i o;^i-i u where the first one contains all elements in A'-_^ that have inner product with all 
elements in B[_^, while the second set contains all elements in that have inner product 1 with 
all elements in B^_^. We set A[_i to be the larger of these two sets and get D{A[_i, B'^^^) = 1 and 
\A',_^\ > M\Ai_i\. 

Concluding, in both cases we obtained subsets of Ai^i,B respectively, such that 

D{A'-_^, B'-_^) = 1 and A'^_^,B[_^ are not too small: 

l4-il>Poiy(^)(4n)"(-(-i))( n ^m)l^.-il, 

and 

This concludes the proof of the inductive claim. 

□ 



3 From approximate duality to communication complexity upper 
bounds 

In this section we prove our main theorem. Theorem 1.6 given the main technical lemma. Lemma 
1.10. The proof of the main technical lemma is deferred to Section 2. 

We start by repeating the necessary definitions. For a {0, l}-valued matrix M, let CC(M) 
denote the communication complexity of the boolean function associated with M. Let rank(M) 
and rankf^(M) denote the rank of M over the reals and over F2, respectively. We denote by \M\ 
the total number of entries in M, and by \Mo\ and \Mi\ the number of zero and non-zero entries 
of M, respectively. We say that M is monochromatic if either \M\ = \Mq\ or \M\ = \Mi\. Finally, 
we define the discrepancy 6{M) of M to be the ratio ^^^^^^^^^H^Mili _ 

Recall the statements of Theorem 1.6 and Lemma 1.10. 

Theorem 1.6 - Main theorem (restated). Assuming the PFR conjecture (Conjecture 1.4), for 
every {0, l}-valued matrix M, 

CC(M) = 0(rank(M)/logrank(M)). 
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Lemma 1.10 - Main technical lemma (restated). Suppose that A,B C satisfy 

D{A,B) > 2~^. Then assuming the PFR conjecture, there exist subsets A',B' of A,B respec- 
tively such that D{A',B') = 1, and \A'\ > 2-'='^/'°s"|y4|, \B'\ > 2-^"/i°s"|S| for some absolute 
constant c. 

We first prove that the above lemma is equivalent to the following one: 

Lemma 3.1 (Main technical lemma, equivalent matrix form). Let M be a {0, l}-valued matrix 
with no identical rows or columns, of rank at most r over ¥2, and of discrepancy at least . 
Then assuming the PFR conjecture (Conjecture 1-4), there exists a monochromatic submatrix M' 
of M of size at least 2~"'/'°s^'|M| for some absolute constant c. 

Proof. We prove only the Lemma 1.10 =^ Lemma 3.1 implication. The proof of the converse 
implication is similar. Denote the number of rows and columns of M by k,l respectively. It is 
well known that the rank of M over a field F equals r if and only if M can be written as the 
sum of r rank one matrices over the field F. Since rank]P2 (M) < r this implies in turn that there 
exist subsets A,B F2, A = {ai, 02, . . . , flfc}, B = {61, b2, ■ ■ ■ , bi} such that Mij = {ai, 6^)2 for all 
1 < i < k,l < j < I. Since M has no identical rows or columns we know that \A\ = k, \B\ = £. 
Note that D{A, B) = 5{M) > 2'^. 

Lemma 1.10 now implies the existence of subsets A ^ A, B' C B, \A'\ > 2-c'-/i°s^|A|, \B'\ > 
2-cr/logr|^|^ such that D{A',B') = 1. Let M' be the submatrix of M whose rows and columns 
correspond to the indices in A' and B' respectively. The fact that D{A',B') = 1 implies that 
Mi J = {ai,bj)2 = const for all Oi £ A' , bj G B' . Therefore M' is a monochromatic submatrix of M 
of which satisfies 

\M'\ = \A'\\B'\ > 2^2cr/logr|^||^| ^ 2^2cr/logr|^|^ 

as required. □ 

In order to prove Theorem 1.6 we follow the high-level approach of Nisan and Wigderson 
[NW95] which was explained in the previous section. They showed that in order to prove the 
log-rank conjecture it suffices to prove that every {0, l}-valued matrix of low rank has a large 
monochromatic submatrix. We start with the following lemma. 

Lemma 3.2 (Existence of large monochromatic submatrix assuming PFR). Assuming the PFR 
conjecture, every {0,l}-valued matrix M with no identical rows or columns has a monochromatic 

submatrix of size at least 2-0(rank(Af)/logrank(M)) |^| _ 

In order to prove the above lemma we use Lemma 3.1, together with the following theorem from 
[NW95], which says that every {0, l}-valued matrix M contains a submatrix of high discrepancy: 

Theorem 3.3 (Existence of submatrix with high discrepancy [NW95]). Every {0, 1} -valued matrix 
M has a submatrix M' of size at least (rank(M))"^/2|M| and with S{M') > (rank(M))~^/2. 

Proof of Lemma 3.2. Let r = rank(M). Theorem 3.3 implies the existence of a submatrix M' of 
M with \M'\ > (rank(M))-3/2|M|, and 6{M') > r'^/"^ > 2'^. Note also that 

rankF2(M') < rank(M') < rank(M) = r. 

Lemma 3.1 then implies the existence of a monochromatic submatrix M" of M' of size at least 
2-cr/iogr| £qj, gQj^g absolute constant c. So we have that M" is a monochromatic submatrix of 
M which satisfies 

\M"\ > 2-'=''/'°snM'| > 2-''''/'°*5V~^/2|M| = 2"^('^/^°s'^)|M| 
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□ 

Proof of Theorem 1.6. Let M be a {0, l}-valued matrix. We will construct a deterministic protocol 
for M with communication complexity 0(rank(M)/ log rank(Af )). We may assume w.l.o.g that M 
has no repeated rows or columns, otherwise we can eliminate the repeated row or column and the 
protocol we construct for the "compressed" matrix (with no repeated rows/columns) will also be a 
protocol for M. 

We follow the high level approach of the proof of Theorem 2 from [NW95]. We will show a 
protocol with 2'-'(^/'°§^) leaves. This will suffice since it is well-known that a protocol with t leaves 
has communication complexity at most 0{\ogt) (cf. [KN97, Chapter 2, Lemma 2.8]). 

Now we describe the protocol. Let Q be the largest monochromatic submatrix of M . Then Q 
induces a natural partition of M into 4 submatrices Q, i?, 5, T with R sharing the rows of Q and S 
sharing the columns of Q. 

Q R 
S T 



M 



Let Ui be a subset of the rows of {Q\R) whose restriction to the columns of R span the rows 
of R. Similarly, let U2 be a subset of the rows of {S\T) whose restriction to the columns of S 
span the rows of S. Note that if Q is the all zeros matrix then the rows of C/i are independent 
of the rows of U2. Otherwise, if Q is the all ones matrix then the rows of Ui are independent 
of all the rows of U2 except possibly for the vector in C/2 whose restriction to the columns of 
S is the all ones vector (if such vector exists). Thus since Q is monochromatic we have that 
rank{R) + rank(S') = |C/i| + IC/2I < rank(M) + 1. 

If rank(i?) < rank(S') then the row player sends a bit saying if his input belongs to the rows of Q 
or not. The players continue recursively with a protocol for the submatrix {Q\R) or the submatrix 
{S\T) according to the bit sent. If rank(i?) > rank(S') the roles of the row and column players are 
switched. 

Suppose without loss of generality that rank(i?) < rank(5'). Then after sending one bit we 
continue with either the matrix {Q\R) which is of rank at most rank(M)/2 or with the matrix 
(5|r) which — thanks to Lemma 3.2 — is of size at most (1 - <5)|M| for 6 > 2-^'"/i°s''. 

Let L{m, r) denote the number of leaves in the protocol starting with a matrix of area at most 
m and rank at most r. Then we get the following recurrence relation: 



L{m, r) < 



L{m,r/2) + L{m{l - 6),r) r>l 
1 r = l 



It remains to show that in the above recursion L{m,r) = 2'-'(^/'°s^). Applying the recurrence 
iteratively 1/6 times to the right-most summand we get 

L{m, r) < 6~^L{m,r/2) + L(m(l - 5)^^^ r) < 2^^/'°§('')L(m, r/2) + L{m/2, r). 

Set A{m,r) := 2~^'^'"/^°^'"L(m, r). Then we have A{m,r) < A{m,r/2) + A{m/2,r) which together 
with A{l,r),A{m,l) < 1 imply A{m,r) < since we may apply the recursion iteratively 

at most logr times to the left term and logm times to the right term before we reach A{l,r) or 
A{m,l). This in turn implies A{m,r) < ('"§ '') < r'^('°s'") due to the fact that r < m < 2^^, 
since we may assume there are no identical rows or columns in the matrix M. 

Concluding, we have L{m,r) < 22c»^'/iogr-+0(iog r) ^ which implies in turn CC(M) = 0(r/logr) 
as claimed. □ 
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